Data Visualization With Multidimensional Scaling
نویسندگان
چکیده
We discuss methodology for multidimensional scaling (MDS) and its implementation in two software systems, GGvis and XGvis. MDS is a visualization technique for proximity data, that is, data in the form of N × N dissimilarity matrices. MDS constructs maps (“configurations,” “embeddings”) in IRk by interpreting the dissimilarities as distances. Two frequent sources of dissimilarities are high-dimensional data and graphs. When the dissimilarities are distances between high-dimensional objects, MDS acts as a (often nonlinear) dimension-reduction technique. When the dissimilarities are shortest-path distances in a graph, MDS acts as a graph layout technique. MDS has found recent attention in machine learning motivated by image databases (“Isomap”). MDS is also of interest in view of the popularity of “kernelizing” approaches inspired by Support Vector Machines (SVMs; “kernel PCA”). This article discusses the following general topics: (1) the stability and multiplicity of MDS solutions; (2) the analysis of structure within and between subsets of objects with missing value schemes in dissimilarity matrices; (3) gradient descent for optimizing general MDS loss functions (“Strain” and “Stress”); (4) a unification of classical (Strain-based) and distance (Stress-based) MDS. Particular topics include the following: (1) blending of automatic optimization with interactive displacement of configuration points to assist in the search for global optima; (2) forming groups of objects with interactive brushing to create patterned missing values in MDS loss functions; (3) optimizing MDS loss functions for large numbers of objects relative to a small set of anchor points (“external unfolding”); and (4) a nonmetric version of classical MDS.
منابع مشابه
Multidimensional Scaling for Evolutionary Algorithms - Visualization of the Path through Search Space and Solution Space Using Sammon Mapping
Multidimensional scaling as a technique for the presentation of high-dimensional data with standard visualization techniques is presented. The technique used is often known as Sammon mapping. We explain the mathematical foundations of multidimensional scaling and its robust calculation. We also demonstrate the use of this technique in the area of evolutionary algorithms. First, we present the v...
متن کاملA Partially Supervised Metric Multidimensional Scaling Algorithm for Textual Data Visualization
متن کامل
Scalable Dimension Reduction for Large Abstract Data Visualization
The ability to browse vast amounts of scientific data is critical to facilitate science discovery. High performance Multidimensional Scaling (MDS) algorithm makes it a reality by reducing dimensions so that scientists can gain insight into data set from a 3D visualization space. As multidimensional scaling requires quadratics order of physical memory and computation, a major challenge is to des...
متن کاملProxiViz: an Interactive Visualization Technique to Overcome Multidimensional Scaling Artifacts
Projection algorithms such as multidimensional scaling are often used to visualize high-dimensional data. However, when attempting to interpret the visualization of the resulting 2D projection, users are faced with artifacts. This poster introduces ProxiViz: an interactive technique to provide better insights about the original data-space. Primary results of a controlled experiment show that Pr...
متن کاملVisualization Methodology for Multidimensional Scaling
We describe methodology for multidimensional scaling based on interactive data visualization. This methodology was enabled by software in which MDS is integrated in a multivariate data visualization system. The software, called “XGvis”, is described in a companion paper (Buja, Swayne, Littman, Dean and Hofmann 2001), that lays out the implemented functionality in some detail; in the current pap...
متن کاملNew Developments of Nonlinear Projections for the Visualization of Structures in Nonvectorial Data Sets
Aalto University, P.O. Box 11000, FI-00076 Aalto www.aalto.fi Author Teuvo Kohonen Name of the publication New Developments of Nonlinear Projections for the Visualization of Structures in Nonvectorial Data Sets Publisher School of Science Unit Department of Information and Computer Science Series Aalto University publication series SCIENCE + TECHNOLOGY 8/2011 Field of research Computer science ...
متن کامل